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Abstract 

In 1929 Lanczos showed how to derive Dirac's equation from a more 
fundamental system that predicted that spin | particles should come in pairs. 
Today, these pairs can unambiguously be interpreted as isospin doublets. 
From the same fundamental equation, Lanczos derived also the correct form 
of the wave equation of massive spin 1 particles that would be rediscovered 
in 1936 by Proca. 

Lanczos's fundamental system was put in Lagrangian form and general- 
ized in 1933 by Einstein and Mayer, who used the semivector instead of the 
quaternion formalism. Although they not did study all possible solutions, 
Einstein and Mayer showed that the doublets consisted of particles with 
different mass and charge. In fact, there are two main classes of doublets: 
proton/neutron and electron/neutrino pairs. 

In trying to use Proca's equation for the electromagnetic field of the 
electron, Lanczos reached the conclusion that the mass term could not be a 
constant, but had to be a function of space-time. On very general grounds, 
he then proposed that the elementary solutions of a fundamental nonlinear 
field theory should be eigensolutions. 

In complete independence of Lanczos, Gerard Petiau discovered in 1957 
a nonlinear generalization of quantum mechanics which is very close to 
Lanczos ideas. In such a theory, the Hamiltonian scales with the 4th power 
of the proper frequency. Postulating a fundamental length equal to ar e , with 
r e the classical radius of the electron, we find that the Lanczos-Einstein- 
Petiau model is applicable to the problem of the mass of the quarks and 
leptons. The compatibility of the existence of this fundamental length with 
quantum theory and special relativity limits the number of quarks to five and 
the number of massive leptons to three. 



*Published in W.R. Davis, et al., Cornelius Lanczos Collected Published Papers With Com- 
mentaries (North Carolina State University, Raleigh, 1998) Vol. Ill, p. 2-1248 to 2-1277. 
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Preliminary remarks 



This commentary was written in 1994 and published in 1998 in the Cornelius 
Lanczos Collected Published Papers With Commentaries 0ffl Vol. Ill, p. 2-1248 
to 2-1277]. The first part, Sees. 1 to 15, is a comprehensive appreciation of the 
papers Lanczos wrote in 1929 on Dirac's equation and relativistic quantum me- 
chanics i0 El El 0^° the second part, Sees. 15 to 17, is a short digression, inspired 
by Lanczos, on the possible relevance of more recent contributions by Petiau 
l\44\ \45\l and Barut M7\ 1491/ to the problem of the mass spectrum of elementary 
particles. 

This corrected and updated version has been posted on arXiv . org on 
August 4, 2005, as a tribute to Sir William Rowan Hamilton, on the occasion of 
his 200th birthday. 

* * * 



1 Introduction 

In 1929, Lanczos was with Einstein in Berlin. At last, he was working with the 
great man to whom, ten years before, he dedicated his dissertation: a quaternionic 
field theory of classical electrodynamics. But Lanczos's mind was not only on 
general relativity. Indeed, he was about to publish a series of three papers on 
Dirac's equation QElEll- 

The year before, Dirac had discovered his relativistic wave-equation for the 
electron. Because of its success in explaining both the electron spin and the 
fine structure of atomic energy levels, all physicists immediately recognized the 
paramount importance of Dirac's discovery. Moreover, the simplicity of the 
arguments leading to its derivation was at once taken as the proof of its absolute 
fundamental character, an opinion that still prevails today. Lanczos had a different 
interpretation. For him Dirac's equation was a derived one: he saw that Dirac's 
theory was a natural development of his quaternionic field theory of 1919 JH. 

With considerable assurance, and in his characteristic didactic style, Lanczos 
passionately puts down his derivation of Dirac's equation. He produces a very 
modern article, which over sixty years later still contains a number of ideas which 
remain at the forefront of fundamental theory. He displays his masterful command 
of quaternionic algebraic techniques and, with a very careful choice of notations 
and minimal use of redundant mathematical jargon, he ends up with a physics 
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paper so different in style from those of his time that one could think that he wrote 
it for the future. 

The exact reasons why these articles were ignored by the vast majority of his 
contemporaries, and the reasons why Lanczos himself abandoned quaternions and 
never returned to quaternionic field theory for the rest of his life, could be the 
subject of research for some historian of science. Already on October 25th, 1929, 
at the lecture he gave at the Physical Society of Berlin, he did not pronounce the 
word 'quaternion' 0. Later 0, in his famous 1931 paper where he discusses the 
fundamental issues raised by general relativity, quantum theory and the unsolved 
'problem of matter,' he returned to his interpretation of matter as singularities in 
an otherwise regular field, and once again compared this idea with the concept 
of eigensolutions which has proved to be so successful in quantum mechanics. 
However, he will not quote any of the four papers he wrote in 1929. 

Lanczos will briefly refer to his three quaternion articles of 1929 only twice, 
once in 1932, in his first article in English Q, and in 1933 in his new derivation 
of Dirac's equation [8|. In this derivation, Dirac's equation is found to be the 
equivalent of Hamilton's first order equations in a problem where a second order 
relativistic potential equation is in correspondence with the second order Euler- 
Lagrange equations. Moreover, as in 1929, Dirac's equation is not found in 
isolation. There is a doubling in the number of solutions, which from four in 
Dirac's theory (two for spin and two for particle/antiparticle) increases to eight, a 
feature that we can today interpret as isospin. 1 

The isospin partner of the proton — the neutron — was discovered in 1932, 
at a time when the existence of the neutrino was already postulated. Thus, one 
may wonder why nobody ever thought of using Lanczos's doubling to explain the 
existence of these particles. The fact is that history took a different course, and it 
now remains for a historian of science to explain why. 

Another mystery is that Lanczos discovered the correct equation for a massive 
spin one particle, including its conserved current and energy-momentum tensor, 
seven years before it was rediscovered by Proca in 1936. Moreover, the whole 
series of Lanczos's articles is a remarkable discussion of the fundamental problems 
of matter, fields, and the origin of mass, most of which is still pertinent today. In 
particular, his intuition led him to foresee the possibility of a nonlinear theory, 
exactly of the kind that Petiau will develop almost thirty years later, and which is 
likely to give an answer to problem of the mass spectrum of elementary particles. 

'However, in his second article of 1933 Lanczos finds no doubling of the Dirac equation. This 
is due to the fact that the four-potential is taken as real. Allowing this potential to become complex, 
two independent Dirac equations would be found again. 
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2 Three problems 



Of course, as in any new development, there were problems in Lanczos quater- 
nionic field theory, and Lanczos himself was very much aware of them. The sad 
thing is that essentially nobody, including some of the most influential physicists 
of the time, tried to follow and possibly improve on Lanczos's work. Among those 
rare contemporary physicists who took interest in Lanczos's approach, only three 
seem to have found it worthwhile to refer to Lanczos's quaternionic papers: Franz 
Sauter in Germany, Gustave Juvet in Switzerland, and Alexandre Proca in France. 
We will come back to them later. 

In short, with hindsight, we can easily find the main problems in Lanczos's 
papers. These problems are of three types. 

First, there are problems of physical interpretation because Lanczos's equation 
is much more general than Dirac's. For instance, apart from the difficulties due 
to the prediction of antiparticles, Dirac's equation was very successful because it 
perfectly suited electrons. The trouble with Lanczos's fundamental system (from 
which Dirac's equation can be derived as a special case) is that it allows for spin \ 
solutions (such as the electron), as well as for spin 0, 1 and |. 2 Obviously, just 
like anybody at the time, Lanczos was completely unaware of the richness of the 
elementary particles spectrum: he thought that his equation described electrons, 
and that the 'doubling' was possibly related to the existence of protons. 

The second problem has to do with spatial reversal. It seems that Lanczos was 
not familiar with the idea that covariance with respect to spatial reversal had also 
to be included in order to have full relativistic invariance. For this reason, a rather 
substantial part of all articles is devoted to giving indirect arguments to justify the 
suppression of unphysical terms (such as 'magnetic currents'), or to discussing 
complicated equations in which contributions of different parities are mixed. 

The third problem is more technical and is connected with the previous one. It 
has to do with the general question of how to write strictly covariant expressions 
in terms of quaternions, including spatial reversal. Given that Lanczos knew a lot 
about quaternions, in general, it follows that Lanczos would have appreciated the 
solution to this problem if someone had solved it at the time. Unfortunately, he 
had to do his best, going back and forth between quaternions and tensors, in order 
to circumvent the difficulty, the precise nature of which he was not really aware. 

Because of these problems, it is not easy to read and comment on Lanczos's 

2 In this paper we are not going to consider explicitly the spin | solutions. However, the main 
point is that since a spinor is composed of 25 + 1 components, one biquaternion is enough to 
describe a spin | spinor (S = §). See ll64l. 
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papers in detail. We have therefore decided to write our commentary rather 
independently of them, although we shall try to follow the order in which Lanczos 
introduced the main concepts as much as possible. Since Lanczos's papers are 
very dense and detailed, we shall concentrate on the more fundamental parts. 
Where required, we shall also correct some of his formulas on the basis of our 
hindsight. In effect, while some of his formulas or wording need updating, it is 
remarkable that Lanczos always remained extremely close to the truth as we know 
it today. Obviously, taking this approach we are at the risk of crediting Lanczos 
with discoveries that in actual fact he did not consciously make. 

3 Formulating and interpreting Dirac's equation 

In the introduction to his first paper d, Lanczos starts by recalling the two 
conditions which led Dirac to formulate his equation: it had to be a linear differ- 
ential system of first order, and the iterated equation had to give the 'relativistic 
Schrodinger wave-equation,' i.e., what we today call the 'Klein-Gordon equation.' 
The first condition is sufficient for having a deterministic evolution equation, and 
the second one insures the relativistic invariance of the result. Thus, with just two 
axioms, Dirac had a solution to the problem of the relativistic electron, which, 
as stressed by Lanczos, has the advantage of being amenable to general 'operator 
methods,' for which one does not have to worry about the difficulties of normal 
tensor analysis. 

Today, Dirac's equation is a cornerstone of physics. Just like Schrodinger's 
equation in the non-relativistic domain, it is the focal point around which the highly 
successful formalism of contemporary relativistic quantum electrodynamics is 
built. Hence, despite the fact that many big problems are still unsolved (i.e., mass, 
charge, infinities, etc.), it is almost heretical to discuss the possibility that Dirac's 
equation is not the most fundamental one. Moreover, it is futile to think that by 
recasting Dirac's theory in another formalism, one may get a serious clue for a 
better theory of the electron and other fundamental particles. 

In fact, Lanczos was not the first to worry about the algebraic structure and 
tensor-analytical relations of Dirac's equation. He quotes Madelung 3 [9| and Dar- 
win ifTDI but today we could cite many authors because this question has become 
the source of an endless stream of studies. In effect, while for the majority of 
physicists the answer would seem to have been the invention of 'spinor calcu- 

3 Erwin Madelung was the first to show that Dirac's equations are somehow equivalent to 
Maxwell's with feedback. Lanczos was Madelung's assistant in Frankfurt from 1924 until Fall 
1928 when he moved for one year to Berlin to become Einstein's first assistant. 
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lus' by van der Waerden, Laporte, and Uhlenbeck [fTTTl. or simply the pragmatic 
approach of using 'Dirac matrices,' for others the question is still open. In fact, 
the contemporary trend would seem to be a return to Lanczos, i.e., to the refor- 
mulation of Dirac 's equation in the framework of Clifford algebras lfT2ll of which 
quaternions are precisely the simplest non-trivial example. 

It is therefore interesting to note that those few who quoted Lanczos's work 
in 1930, were Sauter who was part of a number of young German physicists 
(mainly around Sommerfeld) using Clifford algebras to study Dirac's theory lfT3l . 
and Juvet 4 who was leading a group of Swiss physicists who attempted to phrase 
special relativity, Maxwell's and Dirac's theories, also in the framework of Clifford 
algebras [ 14J . But both efforts faded out, possibly because of the unfavorable times 
in Germany, and the unfortunate death of Juvet in Switzerland. Similar work on 
Dirac's theory took place in France where, independently, Proca 5 tried to introduce 
without success the Clifford formalism [15[. 6 The 'gamma-matrices' and spinor 
formalisms became therefore the standard notations for formulating Dirac's theory. 

But Lanczos's motivation was not to find the ideal formulation of Dirac's 
equation. He wanted to understand Dirac's theory in terms of ordinary tensor 
calculus, and, even more, to show its connections with his own previous work on 
Maxwell's theory. Indeed, Lanczos's starting point is the observation, traditionally 
attributed to Hermann Weyl [16], that Dirac's equation in the case of a vanishing 
rest mass is identical to Maxwell's equation in vacuum (provided Maxwell's field 
is allowed to become a four-component complex field). Since this equation has 
a straightforward tensor interpretation, and is particularly simple when expressed 
in the quaternion formalism, Lanczos immediately saw a new perspective for 
understanding Dirac's equation, and, in his own words, to discover a "via regia to 
grasp the internal substance of Dirac's equation" [1, p. 449]. 

4 Gustave Juvet (1896-1936), translated in French (in collaboration with R. Leroy) Hermann 
Weyl's book "Space, Time, Matter" (A. Blanchard, Paris, 1922). Juvet's PhD thesis, "Sur une 
equation aux derivees fonctionnelles partielles et une generalisation du theoreme de Jacobi" (Blan- 
chard, Paris, 1926) will be used by Paul Weiss in 1938 (Proc. Roy. Soc. A169 page 104). For a 
biography and bibliography, see Actes Soc. Helv. Sci. Nat. 117 (1936) 422-426. 

5 Alexandre Proca (1897-1955) translated in French (in collaboration with J. Ullmo) Dirac's 
"Principles of Quantum Mechanics'" (Press Universitaire de France, 1931). For a biography and 
bibliography of Proca, see G.A. Proca, "Alexandre Proca. Oeuvre scientifique publiee" (Georges 
A. Proca, Paris, 1988) ISBN-2-9502854-0-6. 

6 In these papers Proca follows J. A. Schouten, Proc. Royal Acad. Amsterdam 32 (1929) 105- 
108, in calling the elements of the Dirac-Clifford algebra 'quadri-quaternions.' Other independent 
attempts were made in England by A.S. Eddington, Proc. Roy. Soc. A 121 (1928) 524-542, and 
G. Temple, Proc. Roy. Soc. A 127 (1930) 339-348; in Russia by D. Iwanenko and K. Nikolski, 
Zeits. f. Phys. 63 (1930) 129-137; in Belgium by Th. de Donder and Y. Dupont, Bull. Acad. Roy. 
de Belg. CI. Sc. 16 (1930) 1092-1097, 18 (1932) 596-602, 19 (1933) 472-478, and 19 (1933) 
593-598; and possibly in other countries. 
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4 Biquaternions and the residual arbitrariness in the 
definition of the quaternion product 



Quaternion formulations in physics can be seen as the most simple holistic four- 
dimensional formalism, in which complicated physical variables are treated as 
single entities that are not just abstract symbols, but hypercomplex numbers which 
can be directly used for explicit calculations. In general, the quadruplets will 
consist of complex numbers, hence the name biquaternions. 7 For example, a point 
in Minkowski's space-time can be represented by the biquaternion X = [it,x). 
There are three fundamental involutions: quaternion conjugation, represented by 
a 'bar,' i.e., the operator (), which changes the sign of the vector part of the 
biquaternion; imaginary conjugation denoted by a star ()*; and biconjugation 
which is the combination of the two previous involutions: ( ) + = ( ) . 

With quaternions, physical formulas are generally very compact and there is 
kind of a mystery that we call Hamilton's conjecture ifPTl . as to why so many 
fundamental laws of physics can be neatly expressed in quaternion form. 8 But, 
as with any formalism, there are a number of specific problems that have to be 
carefully addressed in order to have a consistent set of working rules. This is 
particularly the case with discrete operations, such as imaginary conjugation or, 
as we have already mentioned, spatial reversal. To understand the origin of these 
problems, it is useful to digress and to return to Sir William Rowan Hamilton's 
Elements of Quaternions, and more precisely to a note added by Charles Jasper 
Joly in 1898, at the end of section ten lfT8l p. 162]. This will allow us to recall 
some definitions and fix our notation. 

Starting from the set of quadruplets of real or complex numbers, the quaternion 
algebra is obtained by requiring their product to be associative, and the division 
to be feasible always, except possibly in some singular cases. 9 Then, writing two 
quadruplets A and B as [a, a] and [b, b], and using contemporary vector notations, 
their product has the following explicit form 

[a, a] [ft, b] = [ab + p a ■ b, ab + ab + q a x b\. (1) 

The two constants p and q are related by the equation 

q 2 +p 3 = 0, (2) 

7 The algebras of real and complex quaternions are denoted by H and B, respectively. 
8 Ref. [ 17 1 can be useful as a concise introduction to quaternions and their use in physics. 
9 In H the division is always possible. In B, however, division by singular quaternions (also 
called null quaternions), for which SS = 0, is not possible. 
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which shows that there is some residual arbitrariness when defining the product of 
two quadruplets. For instance, taking p — —l,q can be equal to either +1 or —1. 
On the other hand, taking p = +1, q may be +i or —%. 

Obviously, in the present context, the purpose of quaternions is to provide 
an explicit representation of Minkowski's space-time. Taking four real numbers 
as our fundamental quadruplets, we can examine the consequences of specific 
choices of p and q in this context. 

The arbitrariness in the sign of p is connected with the signature of the metric. 
In effect, since the square of the norm of a quaternion A is by definition its product 
by its conjugated quaternion, we have \A\ 2 = AA = a 2 + pa 2 . Thus, as long as 
imaginary conjugation is not given a physical interpretation, the sign of p is in 
principle immaterial, because one can always multiply the vector or scalar part of 
all quaternions by % in order to get the desired signature. 

The arbitrariness in the sign of q is due to the non-commutativity of the 
quaternion product. Indeed, changing the order of the factors A and B is equivalent 
to changing the sign of q. We call the involution associated with the changing of 
this sign it ordinal conjugation 10 and designate this operation by the symbol ( )~. 
Obviously, since q is the sign associated with the vector product, there is a close 
connection between ordinal conjugation and space inversion. However, contrary 
to the case of p, there appears to be no overall covariant criterion to decide for 
the sign of q, or to compensate for an arbitrary choice. Therefore, in accord with 
the principle of relativity, one has to make sure that fundamental physical entities 
are 'ordinal covariant,' 11 i.e., that they do not arbitrarily depend on the sign of 
q. In particular, we will later show how to use ordinal conjugation to construct 
quaternion tensors of definite parities. 

This is the technical problem to which we were referring in Sec. 2. If Lanczos 
had its solution, he would have much more easily discovered Proca's equation. 

Returning to the choice of p, we see that p — +1 gives Hamilton's algebra, 
and p = — 1 the Pauli algebra. As is well known for real components, Hamiltons' 
algebra is a closed 4-element algebra, while Pauli's algebra is not. Since complex 
numbers are to be used as components in the general case, it may seem that the 
choice between Hamilton's and Pauli's algebra is without consequences. In fact, 
while the tradition of quantum mechanics has historically developed from Pauli's 
choice not to use Hamilton's quaternions for his non-relativistic theory of spin, 
but his own set of 'matrices' instead, 12 it turns out that Lanczos's choice to keep 

10 Or order reversal, not to be confused with Clifford algebra 'reversion.' 
11 Or 'order covariant.' 

12 In fact, the first correct treatment of non-relativistic spin is by P. Jordan, Zeits. f. Phys., 44 
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following Hamilton was the good one. The reason is that, when working with 
Hamilton's biquaternions, i is unique, always explicit, and independent of the 
rules of the algebra. 13 In a global field theory which encompasses classical and 
quantum theory, complex numbers and imaginary conjugation can then always be 
given a definite and consistent physical interpretation. 



5 Special relativity and quaternions 

In paragraphs 2, 3 and 4 of his first paper [1], Lanczos recalls the main features of 
the quaternion formulation of special relativity that he had already developed in his 
dissertation [5|. In this formulation, contrary to tensor or spinor calculus, there are 
no indices and all basic physical entities are represented by simple biquaternionic 
expressions. The variance 14 is then specified at the same time as each entity is 
defined. For example, the most basic tensor, the quadrivector, is defined by the 
following variance: 15 

V = CVC + (3) 
where £ is a biquaternion of norm unity, i.e., 

CC = C + C* = 1 (4) 

The fundamental Minkowski invariant is then simply 

VV = V'V = invariant scalar (5) 

which shows that £ represents a general Lorentz transformation. Since any nor- 
malized biquaternion can be resolved in a product C = 1ZB, where 72. = 72.* is real 

(1927) 473^-80. In this paper, Jordan uses quaternions explicitly. It was published a few weeks 
before Pauli's paper. To have a perspective on how somebody of the stature of Sommerfeld felt 
about Pauli's notation version quaternions, it is interesting to read his comment in A. Sommer- 
feld, "Atombau und Spektrallinien - Wellenmechanischer Erganzungsband" (F. Vieweg und Sohn, 
Braunschweig, 1929) p. 317. See also W. Franz, Ref. 1131 . 

13 In the Pauli and Dirac formalisms there are two different imaginary units: one in the respective 
0> or 7 M matrices, and one in the complex numbers multiplying them. This leads to special rules 
with regards to complex conjugation, and to mathematical difficulties which are often not fully 
appreciated. See, e.g., A.J. Macfarlane, J. Math. Phys. 3 (1962) 1 1 16-1 129, where it is shown that 
"the Pauli matrix four-vector does not by itself provide an adequate basis for the discussion of 
two-dimensional unimodular matrices." 

14 For brevity, we would like to characterize the transformation properties of a given geometric 
object by the French term 'variance.' 

15 In tensor calculus, this variance corresponds to the fundamental expression V' = L^V^ where 
could be represented by a 4 x 4 matrix. Just like in spinor formalism, quaternions allow this 
matrix to be broken down into two pieces, which in quaternions correspond to a pre-multiplying 
and a post-multiplying factors. 
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and B = B + is bireal, the general Lorentz transformation consists of the product 
of a rotation 1Z by a boost B. Multiplying quadrivectors, and alternating their 
variance by making use of quaternion conjugation, one obtains more complicated 
tensors. For example, taking two quadrivectors V\ and V2, the tensor 

F = V[V 2 (6) 

has a hexavector variance: 

F' = C*FC + (7) 

Because of ©, / = S[F], the scalar part of the hexavector F is an invariant. On 
the other hand, / = V[F], the vector part of F, has a hexavector variance. 

The interesting thing, which explains the power of the quaternionic tensor 
calculus, is that any time some 4-dimensional quantity is introduced with a given 
variance, the only possible different variance it may obtain is the result of operating 
with one of the three fundamental quaternionic involutions: (), ()*, or () + . 
This gives four possibilities that are analogous to the four basic variances of 
tensor/spinor calculus: contra- or co-variant, dotted or undotted indices. Hence, 
with quaternions, the conjugation symbols play the role of the position and type 
of the index in tensor calculus. The use of these symbols has the effect of keeping 
the variance of all quantities explicit, while at the same time keeping the notation 
compact. 

At this point, it becomes clear that the quadrivector or hexavector variances 
are not the most simple ones. For instance, one can have a spinor variance: 

F' = CF (8) 

where C could be replaced by C, C* or C + , or act on the other side, giving a 
total of eight possible spinor variances. It is immediately seen that the norm of 
such spinors is again a relativistic invariant, and that nothing prevents us of using 
spinors as fundamental physical quantities. Unfortunately, Lanczos did not fully 
realize this possibility. Throughout his three papers, he will insist on keeping only 
'quadrivector' or 'hexavector' variances as fundamental ones. 16 

A special type of Lorentz transformation which was not considered by Lanczos 
is that of spatial reversal. For this transformation, in the case of tensors constructed 
by multiplying quadrivectors, there is a simple connection with ordinal invariance. 
For example, 

V = ABC + CBA (9) 

16 The same was already done in his doctoral dissertation, where in chapter 4 he already noted 
the mathematical possibility of the spinor variance |59 Eq. (4.8)]. 
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where A, B and C are ordinal invariant quadrivectors, is invariant under ordinal 
conjugation. This implies that V cannot depend on the sign of q (see Eq. O). V 
is thus a polar quadrivector (odd parity). On the other hand, 

W = ABC - CBA (10) 

changes sign with ordinal conjugation. W is thus necessarily an axial quadrivector 
(even parity). Hence, for relativistic covariant entities of this type, symmetry or 
antisymmetry under ordinal conjugation is equivalent to being of odd or even 
parity under an improper Lorentz transformation. In the case of spinors, similar 
considerations can be made but the result is not a simple rule as with tensors 
constructed by multiplying quadrivectors. 



6 From Maxwell 's equation to Dirac-Lanczos 's equa- 
tion 

In quaternions, Weyl's equation in vacuum is simply 

VF(X) = (11) 

Here, V = [d it , dg] is the 4-gradient operator, assumed to transform as a quadrivec- 
tor, and F the field, a biquaternionic function of X, which reduces to a bivector 
(i.e., a complex 3-vector) in the case of Maxwell's equation. 

In paragraph 5 of his first paper JT], Lanczos compares equation (fTTT) for 
the cases in which it is interpreted as either Maxwell's or Weyl's equation. In the 
Maxwell case, F is the electromagnetic field bivector, E+iB. The variance is then 
necessarily that of a hexavector because the components of the electromagnetic 
bivector have to transform among themselves. In Weyl's case, however, there is 
no such restriction: the variance is not fully determined by © alone. In fact, 
Lanczos writes flU Eq. (32) p. 456] 

F' = C*FG (12) 

where G may be any biquaternion. In modern language, we would say that when 
F has a spinor variance, equation (fTTT) is invariant under a non-Abelian gauge 
transformation consisting of a right-multiplication by a normed biquaternion G. 
For Lanczos, however, this is seen as a defect, or at best as a peculiar property of 
the wave function that has still to be explained. 
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Lanczos's next step, in paragraph 6, is to consider the first order linear wave 
equation obtained by putting a mass term on the right-hand side of (fTTT) . Then, 
writing 

V£> = mD* (13) 

one gets an equation that is very similar to Dirac's, except for the imaginary 
conjugation on the right hand side which is essential for covariance. In particular, 
D being a biquaternion, it has four complex components, just like Dirac's bispinor. 
However, iterating the equation, since V* = —V, Lanczos finds that the Klein- 
Gordon equation, VV-D = —m 2 D, has the wrong sign. Thus, (IT3l corresponds 
to a particle with an imaginary mass. Because of the imaginary conjugation, this 
problem is not corrected by replacing m by im. In fact, what is missing is a 
quaternion factor on the right-hand side of D*. Taking for example, ie^ for such a 
factor, it is a matter of elementary algebra to verify that the equation 

VD = mD*ie 3 (14) 

is indeed strictly equivalent to Dirac's. This equation, to be called Dirac-Lanczos's 
equation [1 , Eq. (63) p. 462], will later be rediscovered by a number of people, 
in particular by Feza Giirsey 17 W3\ and David Hestenes 18 [20 1. In fact, in one 
form or another, it is the basic equation in any Clifford type formulation of Dirac's 
theory. In this equation, the usual arbitrariness in the choice of the 'Dirac matrices' 
corresponds to the freedom of taking any arbitrary unit vector as a factor on the 
right hand side. 19 

In the remaining part of paragraph 8, Lanczos calculates the bilinear covariants 
of Dirac's equation and some of their properties. In quaternion formalism, this is 
very easily done. One finds, 

1 complex invariant (scalar): I = E>[DD] (15) 

3 complex hexavectors (antisymmetric tensors): I n = V[De n D] (16) 

1 bireal quadrivector (current): J = DD + (17) 

3 bireal pseudo-quadrivectors (axial currents): J n = Die n D + (18) 



17 For a curriculum vitae of Feza Giirsey (1921-1992), and for a list of publications until 1984, 
see "Symmetries in Particle Physics", edited by I. Bars, A. Chodos and C.-H. Tse (Plenum Press, 
New York, 1985). 

18 David Hestenes (1933-) has been successful in popularizing the use of Clifford algebras in 
the context of special relativity, Maxwell, and Dirac theories. See, in particular, "Space-Time 
Algebra" (Gordon and Breach, New York, 1966, 1987, 1992). David Hestenes's father, Magnus 
Hestenes, was one of Lanczos's colleagues at the Institute of Numerical Analysis at the University 
of California at Los Angeles in the early 1950's. A photograph of David Hestenes's mother in 
company of Lanczos, circa 1951, is reproduced on page 3^463 ofRef. |58|. 

iy See the Appendix for a brief comparison of some of the remarkable differences between the 
Dirac and the Dirac-Lanczos equations. 
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These bilinear tensors have a simple interpretation. For instance, the existence of 
a magnetic moment associated with spin corresponds to the complex bivector I 3 . 
This is the antisymmetric electromagnetic polarization-magnetization momentum 
tensor. The four bireal quadrivectors form an orthogonal Frenet frame which 
moves along with the particle ll2T1l . a property that Lanczos associates with Ein- 
stein's concept of 'distant parallelism.' From equation (fl4l . i.e., Dirac-Lanczos's 
equation for an electron without an electromagnetic field, it follows that the par- 
ticle current J, as well as the axial currents J\ and J 2 , are conserved, while (in 
general) the spin density axial current J 3 is not conserved (it is conserved, e.g., 
for plane waves). According to Giirsey El l , it seems that J\ and J 2 will not be 
noticed until de Broglie will rediscover them in 1940. 

7 Lanczos 's fundamental equation and the isospin 
doubling of Dirac's equation 

As we have already said, Lanczos's main interest is not in reformulating Dirac's 
equation with quaternions, but in showing how to derive it from a more fundamental 
equation. His idea is to consider a system of two coupled equations, in which case 
the Klein-Gordon equation is automatically satisfied. This leads him in paragraph 
7 and 8 to postulate and study the properties of his fundamental equation JH 
Eq. (54) p. 459], which we write 

VA = mB VB = mA (19) 

Obviously, compared with Dirac's equation, this system has twice too many equa- 
tions and variables. But, as shown by Lanczos, it is in fact equivalent to two 
Dirac-type equations, independent of each other it provided A and B have a spinor 
variance, i.e., 

A' = CAG N B' = C*BG N (20) 

and Gn, an optional gauge transformation, is any element of a group which will 
be determined below. To obtain this result, Lanczos uses a rather long argument 
that can be summarized and somewhat generalized as follows: 

Given an arbitrary idempotent 20 biquaternion a, every biquaternion Z E B can 
be written as a linear combination 

Z = Z x a + Z 2 o (21) 

20 An idempotent a = a 2 has the property that aa = . For definitiveness, we take a = 
|(1 + iez), where e 3 is the third quaternion unit. 
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where Z\ G H and Z 2 eW. are two real quaternions. In spinor language, this gen- 
eral theorem simply means that if we interpret a biquaternion as a (4-component)- 
bispinor, it can always be decomposed into a pair of independent (2-component)- 
spinors. Hence, for Z equal to A or B, a can be used as a projection operator 
which enables one to define two independent linear superpositions, each of which 
is a new bispinor: 

D x = Aa + B*a (22) 

and 

D 2 = (Aa - B*o)ie x (23) 

These superpositions are Lorentz co variant because of d20l) . and the factor ie x , 
orthogonal to e 3 , has been introduced in order to agree with the modern isospin 
interpretation. It is then easily verified that both D x and D 2 are solutions of Dirac - 
Lanczos's equation (fT4"l) . The allowed gauge transformations G N are obviously 
those transformation which leave the superpositions (I22TI231 invariant. Since these 
transformations have also to leave the probability current J invariant, it follows 
from (fTTt and (l20t that they have to be unitary, i.e., such that G N G N + = 1. Hence, 
the allowed gauge transformations are elements of the group UN(1,C) which 
comprises all unitary quaternions that commute with a. Consequently, the only 
gauge invariant bilinear covariants are /, I 3 , J and J 3 , i.e., the bilinear covariants 
of the standard Dirac theory. 

The group UN(1,C) ~ U(l) <S> U(l), whose elements have the following 
explicit representation 

Gn = cr exp(— e*3«) + o 7 exp(— e*3/9) (24) 

will be rediscovered by Nijishima [22J in 1957. He will show that this is the largest 
group leaving the Dirac equation and the structure of quantum electrodynamics 
invariant. 

In the gamma-matrices formulation of Dirac 's theory, the interpretation of 
Nishijima's group is not obvious. But here it is trivial. Suppose we make the 
spinor decomposition of A and B according to (0T1) and multiply from the right 
by G N , the result is 

AG N = Ai exp(-e 3 a) + A 2 exp(-e 3 /3) (25') 

BGk = B x exp(-e 3 a) + B 2 exp(-e 3 /3) (25") 

Comparing with (|22l - i23l . we immediately see that while A and B transform 
with the same gauge G N , D\ and D 2 transform with exp(— e 3 a) and exp(+e 3/ 9), 
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respectively. In other words, if we interpret G N as a local gauge, D\ and D 2 
correspond to states with different couplings to the gauge field. 

To get closer to the modern isospin interpretation, we make the following 
gauge transformation: 

G N (jV,7) = crexp(-e 3 — - — 7) + <xexp(+e 3 — - — 7) 

= exp(iy7) exp(-e 3 ^7) (26) 

Hence, if 7 is the electromagnetic gauge field, and J\f the nucleon number, the 
electric charges of D\ and D 2 are respectively Qi = and Q 2 = ^jjp. This 
is nothing but Heisenberg's formula which gives the electric charge in function of 
the nucleon number and the projection of isospin: 

Q = y + *3 (27) 

This formula was proposed soon after the discovery of the neutron, and later was 
generalized to hypercharged particles by Gell-Mann and Nishijima. For A/" = 1, 
D\ is the proton ( t 3 = +| ) and D 2 the neutron ( t 3 = — |). 

The interpretation of D 1 and D 2 as an isotopic doublet was given for the first 
time by Giirsey lf23l in 1957. Giirsey also showed that the isospin raising/lowering 
operator is right multiplication of A and B by ie\. This interpretation, and the fact 
that the isospin raising/lowering is done by operating on A and B, confirms that 
the fundamental fields are Lanczos's fields A and B, and that the Dirac fields Di 
and D 2 axe derived ones. 

Lanczos, of course, could not have given this interpretation. He published his 
papers in volume 57 of Zeitschrift fiir Physik, while Weyl's seminal paper on the 
local gauge interpretation of electrodynamics had just appeared in the previous 
volume ll241 . Lanczos's interpretation was that the 'phase factor' was without 
consequence since it would disappear in what he called a "quantummechanical 
hermitian operation." In any case, Lanczos had the electron-proton doublet in 
his mind. Nevertheless, he was right to be convinced that the decomposition 
of his fundamental system (fT9"l) into two independent Dirac's equations was of 
fundamental physical importance. 
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8 Minimum coupling. Two-component electron the- 
ory. 



Now that we have found the correct interpretation of Lanczos's doubling, it is 
worthwhile to have a look at Lanczos's second paper [0 p. 476] in which he 
considers a particle in a non- vanishing electromagnetic field. As he did not use the 
local gauge invariance concept, Lanczos used the minimum coupling substitution 
instead, i.e., the replacement of V by V — ecp in (fT9l . where </> is the potential of 
the Maxwell field. He then made the superposition (1221 . once with the idempotent 
a — |(1 + ie 3 ), and a second time with the sign of ie 3 reversed. This gave the 
following equations 

VL>i - e0LMe 3 = +mD 1 *ie 3 (28) 

V/y + e^Dt'ies = -mD x '*ie 3 (29) 

Hence, for these superpositions, the two Dirac fields have electrical couplings 
of opposite signs, something that is explicitly remarked by Lanczos [2, p. 476]. 
But, as the right hand sides show that the masses are equal and of opposite signs, 
we see that the electron/proton interpretation that Lanczos was looking for is not 
consistent. 

In fact, if instead of Lanczos's second superposition leading to (l29l . we make 
the proper orthogonal isospin superposition d2"3l — after having made the mini- 
mum coupling substitution in (fT9"l) — we find that both Di and D 2 satisfy the wave 
equation (l28t and therefore have the same electric charge. This shows that the 
minimum coupling substitution is not sufficient to get the correct isospin interpre- 
tation of Lanczos's fundamental system. For this interpretation, it is necessary to 
make the gauge transformation (l2"6l) of the fields A and B. 

In the same paper, Lanczos calculates the second order equation for his funda- 
mental fields A and B: 

(VV - m 2 + e 2 H>)A = e(0V + V0)A (30) 

(VV-m 2 + e 2 #)5 = e(0V + V(f))B (31) 
For comparison, let us include the result for Dirac-Lanczos's field d2"%t as well 

(VV - m 2 + e 2 cf4>)D = e(0V + V4>)Die 3 (32) 
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In this equation, the biquaternion D is equivalent to a 4-component Dirac spinor. 
But, since a biquaternion can always be split in two parts according to the de- 
composition (|2lT) . (l32l can be replaced by a pair of equations of the form (l30l in 
which A is a singular quaternion, i.e., equivalent to a 2-component spinor ll25l . 
In this case, each of these singular second order equations is equivalent to one 
2-component spinor second order equation. 

Hence, when A is singular, (l30l) is equivalent to the relativistic Pauli equation 
(sometimes called the Kramer's equation |26|) which will be reinvented by Feyn- 
man and Gell-Mann ll2"71l in 1958 in order to justify the 2-component theory of 
weak interactions. In fact, with a proper interpretation, this second order equation 
is totally equivalent to the first order Dirac equation. 21 It can therefore be used to 
reformulated quantum electrodynamics in 2-component spinor formalism instead 
of the usual 4-component one ll28ll . The same applies to the singular quaternion 
version of d3*2l which leads to an elegant and compact quaternionic formulation 
of quantum electrodynamics [ 29 ] . It is therefore a great compliment to Lanczos, 
thirty years later, to read in Feynman and Gell-Mann's paper ll27l p. 194]: "... let 
us imagine that [Lanczos's second order equation] had been discovered first, and 
[Dirac's first order equation] only deduced from the later." 



9 Einstein-May er-Lanczos: proton/neutron and elec- 
tron/neutrino doublets in the same Lagrangian 

When he wrote his 1929 papers on Dirac's equation, Lanczos was with Einstein 
in Berlin. It is very plausible that Lanczos had some discussions with Einstein 
on his ideas about quaternions and Dirac's equation. The fact is that Einstein 
with Walter Mayer, his next collaborator after Lanczos's departure to the United 
States, will develop a new four-dimensional formalism: the semivectors QUI . This 
theory is rather similar to quaternions, except that the metric is hyperbolic instead 
of elliptic. 22 Because they originated from Einstein, they remained a parallel 
development to spinor calculus for a few years. 

In their first paper, submitted in 1932, Einstein and Mayer rewrote Lanczos's 
fundamental equation (fT9l . i.e., [30, Eq. (50) p. 539], as well the Dirac-Lanczos 
equation (fl4l . i.e., [30, Eq. (51) p. 540], in semivector form. However, in neither 
case did they credit Lanczos. Instead, after noting that the Dirac-Lanczos equation 

21 Already in 1933 in his new derivation of Dirac's equation |8 1, Lanczos showed that either a 
first order or second order formalism can be used to describe a spin 5 particle. 

22 The close proximity of semivectors to quaternions was pointed out by J. A. Schouten, Zeits. f. 
Phys. 84 (1933) 92-1 1 1, and others, e.g., (B). 
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could be derived from the fundamental equation by "specializing" the later in such 
a way that it contains just one four-dimensional field and its complex conjugate, 
Einstein and Mayer referred to the Dirac-Lanczos equation as a "Schrodinger 
equation." 

In 1933, Einstein and Mayer derived a spin | semivector field equation (in fact, 
a generalized form of Lanczos's fundamental equation) predicting that particles 
would come in doublets of different masses [31|. 23 Because of the similarity of 
semivectors to biquaternions, we can easily translate this model into our notation. 
The idea was that the most general Lagrangian for quaternionic fields, to be called 
the Einstein-May er-Lanczos (EML) Lagrangian, should have the form: 

L = A + ■ VAC + B + ■ VBC - (E + A + ■ B + B + ■ AE) + (...) + (33) 

where the C and E fields are invariant, except possibly for spatial reversal. The 
field equations are then 

VAC = BE + VBC = AE (34) 

which reduce to dT""t when C = 1 and E = m. For the modern interpretation, we 
can put C = I. 24 

Compared with Lanczos's fundamental system dT9l. we see at once that ("3*4"!) 
is no more symmetric between A and B: parity invariance is lost. In effect, since 
spatial reversal replaces V by V in Lanczos's equation, the parity conjugate field 
of A is B, and vice versa (in spinor language, A and B are opposite 'chirality' 
fields that are interchanged in a spatial reversal). This interchange is no more 
possible with the generalized system (13*41) when E + ^ E. 

Another major difference with Lanczos's fundamental system is that the second 
order equations for A and B are now eigenvalue equations for the mass. In effect, 
assuming C — 1 for simplicity, and eliminating the A or B fields in (I34|> . we find 
the generalized Klein-Gordon equations 

VVA = AEE + VVB = BE + E (35) 

where m 2 has been replaced by EE + or E + E. Assuming a plane wave solution 
of the form A = Q exp i(p-x — Et), we obtain the eigenvalue equation 

m 2 Q = QEE + (36) 

23 In fact, Einstein and Mayer actually considered the most general Lagrangian including the 
generalized Dirac-Lanczos field, the Maxwell field, and gravitation. 

24 In the language of particle physics, E and/or C correspond to some pseudo-scalar or scalar 
fields. For example, E can be interpreted as the Higgs field when C = 1. See, e.g., |62|. 
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and a similar equation for B. The important point is that although EE + ^ E + E 
in general, the two different values of m 2 given by the eigenvalue equations are the 
same for the A and B fields. It is therefore always possible to isolate the isospin 
eigenstates by making the isospin superpositions (|22l - l23t . There are two main 
classes of solutions. 25 

The first one is when E is real: E = E* = T. The field equations become 

VA = BT VB = AT (37) 

In this case, spatial reversal invariance is restored if T is assumed to transform into 
T, which implies that T can be interpreted as a pseudoscalar field. Now, because 
EE + = TT = m 2 is a scalar, there is no mass eigenvalue equation and the two 
isospin states are degenerate in mass. Because of that Einstein and Mayer will 
discard this case. This is very unfortunate, because if we make Lanczos's isospin 
decomposition (I2"2"l - l2"3l . we find that d3"7t corresponds to a doublet of equal masses 
spin | particles, and that these particles interact by means of the pseudoscalar field 
T. This interpretation will be first given by Giirsey ll32l . He will show, as far as we 
know completely independently of Lanczos or Einstein-Mayer, that equation (l37t 
describes a nucleonic field, non-locally coupled to a pseudoscalar eta-pion field. 
In other words, Giirsey,'s interpretation implies that (ITTb explains to first order the 
whole of low-energy nuclear physics: protons, neutrons, charge independent pion 
theory of nuclear forces, etc. 

The second class is when E is bireal: E = E + . There is now an eigenvalue 
equation because, in general, EE + = E 2 is not a scalar anymore. Then, there are 
always two solutions of different masses. Einstein and Mayer studied this case 
in detail because they were expecting to find an explanation for the existence of 
the electron and the proton. Of course, today we know that we cannot expect to 
find these particles in the same multiplet because the electron is a lepton, while 
the proton is made out of quarks. Nevertheless, it is interesting to note that when 
C / 1, the Lagrangian d33l) has solutions which correspond to pairs of particles 
with different masses and electric charges of opposite signs. 

In fact, it is possible to recover Einstein-Mayer's result in the case (7=1, 
provided one notices the existence of two basic conserved currents: the probability 
current J, and the barycharge current K 

J = AA + + ~BB+ (38) 



K = AEA + + BEB+ (39) 

25 The first systematic study of the mass spectrum of the Lanczos-Einstein-Mayer equation J34l > 
was made, at the request of W. Pauli, by V. Bargmann, Helv. Phys. Acta 7 (1934) 57-82. 
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While J has the usual interpretation, K divided by the mass corresponds to the 
electromagnetic current as defined by Einstein-Mayer. Then, when EE > the 
two solutions have equal signs for the charge, and when EE < the signs are 
opposite, as in Einstein-Mayer's case. However, for the modern interpretation, the 
most interesting solutions are obtained in the singular case where EE = 0, i.e., 
when E is an idempotent. One solution is then massive (with K = J), and the 
other one massless (with K = 0): an electron-neutrino doublet. 

The proton-neutron and the electron-neutrino solutions have the common fea- 
ture that in both cases the E field is also a global gauge field. This is very 
important because it provides a clear distinction with the other solutions of the 
EML Lagrangian. Hence, in summary, by simply writing down the most trivial 
quaternionic generalization of the Lagrangian giving the Lanczos's fundamental 
equation (fT9l) . we obtain a theory which predicts the existence of the four most 
important particles of matter: the proton, the neutron, the electron and the neutrino. 

10 Einstein-Mayer-Lanczos: towards a unified pic- 
ture of matter and fundamental interactions ? 

In the contemporary theory of elementary particles, a fundamental concept is the 
idea that all interactions are mediated by local gauge fields. In the Standard 
Model, the gauge fields of the unified electro-weak interaction are operating in an 
abstract space which corresponds to the transition-current algebra of perturbation 
theory. Therefore, the SU(2) (g> U(l) symmetry of this model concerns a kind 
of superstructure which is built upon a field theory in which the Dirac equation 
is the most fundamental equation of matter. In such a perspective, there is no 
explanation for the origin and nature of the gauge fields, or for the pairing of the 
leptons and quarks into isospin doublets. The whole picture can be described as a 
very successful and effective phenomenology. 

If instead we assume that the most fundamental equation is not Dirac 's but 
Lanczos's equation (fl9t . we start with a system that explains isospin in a natural 
way. Moreover, if we include the generalization of Einstein and Mayer, we have 
a possibility of understanding the origin and nature of the gauge fields [33 1. In 
effect, as we have already explained, and as was shown for the first time by Giirsey 
in the case of pion-nucleon interactions [32|, the 'internal' symmetries are explicit 
and trivial in Lanczos's equation, while only space-time symmetries are explicit in 
Dirac's equation. In other words, in this perspective the Lanczos fields A and B, 
and the gauge fields allowed by the generalized Lanczos's equation, provide a kind 
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of infrastructure from which the physical fields and their interactions are derived. 
If this picture is correct, there is no abstract 'isospin space' or 'current-algebra 
space,' but a unified field theory in which everything is defined in space-time. 

Consider first the case of electromagnetic interactions. As shown in Sec. 7, the 
electromagnetic gauge field is combined with the nucleonic or leptonic number 
gauge field in the Nishijima group. This is the largest group compatible with the 
Lanczos-Giirsey isospin superpositions (I221 - I231) . From this group we understand 
electromagnetic interactions, the (+1, 0) and (0, —1) charges of the nucleons and 
neutrino/electron pairs, as well as the possibility of (+|, — ~) charges for the quark 
states. 

The second kind of interactions which concern both the nucleons and the 
leptons are the weak interactions. These parity violating interactions correspond 
to a non-Abelian gauge transformation which is made possible by the E field in 
the EML equation (l34l) . In effect, in the absence of E, all gauge fields have to act 
symmetrically on the A and B fields, and thus conserve parity. On the other hand, 
when there is an E field, gauge transformations such that A — > AG, B — > B and 
E — > G + E are feasible. The corresponding gauge field has an SU(2) symmetry, 
and the resulting phenomenology is equivalent to that of the Standard Model 

For the leptons, the weak interaction is the only operating non-Abelian field. 
This limitation is due to the fact that E is singular. 

For the nucleons, E = E* = T is a regular quaternionic field. One can 
thus perform isotopic rotations on the A and B fields independently. The gauge 
symmetry is thus the chiral SU(2) (g) SU(2) symmetry of low-energy strong 
interactions. Therefore, as we have explained in Sec. 10, the T field leads to the 
strong interactions in the form of the well-known charge-independent pion-nucleon 
theory. Of course, as far as a general theory of strong interactions is concerned, 
this theory has long been superseded by 'quantum chromodynamics.' One could 
therefore object that such a primitive formulation would be a return to the 1960's 
and to the naive 'sigma model' of Giirsey and Gell-Mann-Levy [32|. Indeed, that 
is the case, and provided one takes into account all the subsequent progress that 
has been made in the interpretation of this theory, there is no contradiction with 
the high energy limit where 'quarks,' 'confinement,' and 'asymptotic freedom' 
become the key concepts. The contemporary problem is precisely to find a new 
theory which would encompass both of these low and high energy limits ll35l . 

To conclude, we note that the Einstein-Mayer generalization of Lanczos's 
equation is obtained by the substitution Am — > AE in the EML Lagrangian. 
Since this substitution is not symmetrical between A and B, the lifting of the 
mass degeneracy of Lanczos's original equation (IT9b and the possibility of weak 
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interactions, are both linked to a maximally parity violating field. 



11 Lanczos's discovery of Proca's equation 

In the last and longest section of his first 1929 paper Q], Lanczos starts his struggle 
in order to give a normal tensor interpretation to his fundamental equation. His 
motivation is clearly that since he has been successful in deriving Dirac's equation 
from it, the fundamental equation itself — once properly interpreted — should say 
something important about the nature of the electron and in particular the origin of 
its mass. It is at this point that the difficulties we already mentioned (with physical 
interpretation, spatial reversal, and 'ordinal' covariance) become important. 

In the case of Lanczos's fundamental equation (fT9l) . a proper covariant inter- 
pretation requires the definition of a non-ambiguous variance for the A and B 
fields, and (at least) the construction of a conserved and ordinal invariant particle 
density probability current. For the first requirement, Lanczos easily finds that 
there is only one simple possibility (provided that space reflections are not taken 
into account). One field has to be a quadrivector, and the other a hexavector: 

A' = CAC + B' = C*BC + (40) 

Under these conditions, (fl9l) is no more symmetrical between A and B, and A can 
be called the 'potential' and B the 'field.' 

The particle current must be a bireal quadrivector, and Lanczos easily identifies 
a candidate expression, A + B + B + A, which, unfortunately, is not conserved: 

V • (A + B + B + A) ^ (41) 

Lanczos then enters into a long discussion, which he will continue in the 
following papers. He will get very close to the root of the problem and, mostly 
following his intuition, he will discover what he was looking for: a fully covariant 
relativistic equation with a traditional tensor interpretation. 

Alexandre Proca will derive his equation in 1936 [36 1. But, most extraordinary, 
Proca's equation (in tensor form) is already explicitly written down in Lanczos's 
first paper HI Eqs. (98A,B), p. 472] and will be repeated in the following ones 01 
p. 49 1 ] and p. 1 24] . Believing that he had found a new equation for the electron, 
Lanczos had in fact discovered Proca's equation. Strangely enough, Proca will 
make the same mistake: only in 1938, after taking the non-relativistic limit, he 
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will discover that his equation does not describe an electron, but a massive spin 1 
particle [37]. 26 

To obtain this remarkable result, Lanczos starts by converting his quaternion 
system dT9l into tensor form. He then removes all the terms which are linked with 
'magnetic currents' and finds that the resulting system has now a conserved current. 
In this process, Lanczos twice uses his physical intuition of a fundamentally four- 
dimensional world. First, he discards the scalar part of B, because Lorentz 
invariants (such as mass or charge) should have a clear physical interpretation 
(such an interpretation for §[£>] is here not readily available). Second, he discards 
the 'magnetic current,' because the magnetic/electric 'duality' is essentially a 
three-dimensional concept. Finally, he uses one more heuristic guide, the idea that 
the resulting system should have the same number of equations as of unknowns, 
so that, just like Dirac's equation, the retro-coupled system is homogeneous, and 
therefore suitable for having 'eigensolutions' of the kind that are important in 
quantum theory. 

To conclude, it is of interest to compare Lanczos's derivation of the field 
equation of a massive spin one particle with that of Proca [36|. In effect, it is 
clear that Proca's derivation, which is based on the Lagrangian formalism, is much 
better than Lanczos's intuitive and laborious one. But, apart from that, the main 
characteristics of Proca's equation, including its current and energy-momentum 
tensor, are already contained in Lanczos's papers of 1929. The interesting point, 
which makes us wonder about the influence Lanczos's work might have had on 
Proca, is that Proca published in 1932 a paper ll3lfll in which he refers to all three 
of Lanczos's articles of 1929, as well as to Lanczos's dissertation of 1919. In 
this article Proca uses biquaternions and tries to explain the electron-proton mass 
difference by means of a modified Maxwell's equation. Actually, Proca uses 
Lanczos's fundamental equation (fT91 with m — 0, and twice refers to it under 
the name of "Lanczos's equation." It is therefore not unreasonable that what is 
now known as the "Proca equation" for a massive spin 1 particle should in fact be 
called the "Lanczos-Proca equation." 

26 In this paper, page 63, Proca qualifies the quadrivector variance £( )C + as 'quaternionic,' and 
the spinor variance £( ) as 'spinorial.' 



24 



12 Quaternionic derivation of the spin and spin 1 
wave equations from Lanczos's fundamental equa- 
tion 

The successful derivation of the spin | Dirac-Lanczos equation (TBb came from the 
implicit assumption that Lanczos's double equation fHJt was the fundamental one 
and that the A and B fields had to be treated on an equal footing. This meant that A 
and B were interpreted as components of a 'bispinor' and that the 'Dirac field' was 
obtained by making an appropriate covariant superposition. Hence, in the same 
spirit, one should not try to give a physical interpretation to (IT^l directly as such, 
but assume other possible meaningful relativistic variances, and then construct the 
physical fields by making some suitable superpositions. 

The derive Proca's equation, the crucial point is that while Lanczos correctly 
guessed that for a proper Lorentz transformation the variances assigned in (BUb 
were adequate, he did not realize that he had to make further assumptions in order 
to define physical fields with a correct variance under spatial reversal as well. 

For example, if we assume that A is ordinal symmetric, i.e., that A = A~ is 
an odd parity quadrivector, A may indeed be a physically meaningful field. On 
the other hand, B = V 'A is neither ordinal symmetric nor antisymmetric, and 
therefore has no definite parity. Leaving this problem aside temporarily, we can 
nevertheless use B as an intermediate field in order to construct a fully covariant 
equation for A. To do that, let us rewrite Lanczos's equation (TH?t together with 
its ordinal-conjugated one, and at the same substitute B for mB to agree with the 
standard notation for integer spin fields: 

VA = B VB = m 2 A (42) 

A~V = B~ = m 2 A~ (43) 

Then, since A = A~, the superposition which leads to an ordinal symmetric 
potential is obtained by adding the two equations on the right of d4"2"l) and (14*3*1) . We 
get the system 

VA = B VB + B~V = 2m 2 A (44) 

which is now a fully covariant equation for a polar quadrivector field A. But it 
is still not entierly correct. The reason, now that we have taken care of parity, 
is that the fields in (|44h are in fact mixtures of spin and spin 1 states. This 
is because the variance (l40b is such that the scalar part of B is an invariant 
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while the vector part transforms as a hexavector. Hence, the field B = [b,b] 
can be covariantly decomposed into its scalar and vector parts. Introducing the 
corresponding decomposition of the potential A = Aq + A\, the system d44l can 
be split into a spin and a spin 1 part. 

For the spin zero part, because A = A" = A , we see from d4"2"l - l4"3l that 
b = 6~. The system (1441) reduces therefore to the following degenerate form of 
Lanczos's equation 

V • A = b X7b = m 2 A (45) 

For the spin one part, the resulting equation is just (14*41) with the field B 
restricted to its vector part b. This gives the biquaternion form of the Lanczos- 
Proca's equation for a massive spin 1 particle [1 , Eq. (98)], [36, Eq. (21)]: 

V A A x = b Vb + 6~V = 2m 2 A x (46) 

With this equation it is trivial to verify that the correct conserved spin 1 particle 
current is now 

J = A 1 +b + b+A 1 + (...)~ (47) 

The parentheses mean that the full expression is obtained by adding the ordinal 
conjugated of the first part, which has the effect of making the total ordinal invariant 
and divergence-less, so that Lanczos's non-zero result d4*TT) is corrected. 

The key assumption in going from Lanczos's fundamental equation (fT9l to 
(l45l) and (14*61) . i.e., the field equations for a massive 'scalar' or 'vector' particle, 
was that A had to be an ordinal symmetric vector. If we had assumed the second 
fully covariant possibility, i.e., that A is ordinal antisymmetric, we would have got 
the field equations for a pseudo-scalar and respectively a pseudo-vector particle. 
The only difference is that in the second equation of d46l) . the plus sign has to be 
replaced by a minus sign. The fact that all equations for particles of spin and 1 
are degenerate cases of Lanczos's equation has first been shown by Gursey in his 
PhD thesis HH p. 162]. 

Before concluding, two comments are in order. First, to get equation (l4"4l we 
have added the two equations on the right of ("4*2"l) and (l"""""l) . But, to be complete, 
we have also to take into account the equation obtained by subtracting them: 

X7B - B~X7 = (48) 

This equation corresponds to the requirement that there should be no magnetic 
poles or currents. However, it can easily be verified that (l48t is redundant and that 
(l46b is sufficient to eliminate the magnetic contributions. The second comment 



26 



is that since we have Proca' equation (HoT) . we can now identify the quaternion 
representation of the spin 1 field anti- symmetric tensor. It is 

2F() = ()6 + 6~() (49) 

Contrary to the bivector b, this tensor is ordinal symmetric, and therefore defines 
a physical field of odd parity. 

To summarize, we have found that, apart from its decomposition into a pair of 
independent spin ~ fields, Lanczos's fundamental equation also had two possible 
decompositions into a spin and a spin 1 field, one for scalar and vector particles 
and the other for pseudo-scalar and pseudo-vector particles. 



13 Maxwell's equation and the absence of magnetic 
monopoles 

As we have said in the introduction, Lanczos's motivation for introducing his 
fundamental equation was the idea that it was in fact Maxwell's with feedback. 
Now that we have derived Proca' s equation d46T) . we can set m 2 = to get the 
quaternionic field equation of a mass-less particle of spin 1. Including a source 
term with a current J we obtain: 

VAA = b Vb + b~V = -8irJ (50) 

This is the correct form of Maxwell's equation in quaternions. It explicitly implies 
that the source current, and thus the potential, has to be ordinal symmetric and 
shows that the electromagnetic field is an antisymmetric tensor. The correct 
quaternionic form of this tensor d49l was first given by Kilmister in 1955 ll39l . 

The usual form of Maxwell's equation iRUl |5) contains the electromagnetic 
bivector b instead of the electromagnetic tensor (l49l : 

VAA = b V6=-4ttJ (51) 

This equation is formally incorrect because it is not covariant under spatial reversal. 
However, because of d4~51) . it is in practice equivalent to (l50b . provided that A and 
J are postulated to be ordinal invariant. 

Equations (BoTBTT) are valid for charged as well as for neutral massless particles. 
To identify the neutral fields, we observe that, in both Proca's or Maxwell's case, 
whenever the potential is bireal the current d47t is zero. Electrical neutrality is 
thus the condition A = A + , which in turns implies J = J + . 
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A very important consequence of our derivation of Maxwell's equation from 
Lanczos's fundamental system dT9l is that it implies, by (|4*%1) . the non-existence 
of magnetic monopoles for odd parity vector fields. 27 This statement is valid 
even though we have not required A or J to be bireal. Hence, the non-existence 
of magnetic monopoles is independent of the so-called 'Rainich-' or 'duality- 
transformation' [41 1, which is simply the statement that (1501 - 1511) are invariant 
in a transformation which amounts to multiplying all terms by the same complex 
number. 28 In other words, for Maxwell's field, it is always possible to take A = A + 
and J = J + . 



14 The energy-momentum tensor and the stability 
of matter 

There is a clear difference in style between Lanczos's first papers on Dirac's 
equation [1J and the whole series which followed and culminated in the last paper 
he wrote in Europe before leaving for the United States 0. 

The first paper can be seen as a typical textbook like expose, perfectly orga- 
nized in consecutive sections, were everything fits together, except for the last 
section in which the quaternionic formalism is abandoned and various heuristic 
arguments are used to derive Proca's equation. From there on, the style becomes 
less didactic, and Lanczos starts presenting more and more speculative ideas about 
the unsolved problems of matter. At the same time as he abandons the purely 
deductive presentation of his first paper, he becomes more self-assured, and goes 
on to take real risks, such in the last sentences of his third quaternionic paper, 
where he prophesies the "end of quantum theory as a self-contained discipline", 
and the emergence of an essentially nonlinear "theory of matter" £2 P- 493]. 

When reading the text of an invited paper he gave at the October 25, 1929, 
meeting of the Physical Society of Berlin [4], it is not difficult to imagine the 
feelings of those listeners who primarily came to learn more about the relativistic 
electron theory which finally succeeded in explaining the atom: Who is that man, 

27 In recent years, the consequences of the null results in the search for magnetic monopoles, 
and in particular the implications of an essentially asymmetric form of Maxwell's equations, has 
been most thoroughly analyzed by E.J. Post. See, for example, E.J. Post, Phys. Rev. D9 (1974) 
3379-3386. 

28 For an analysis of the 'monopole question' using 3-dimensional vector analysis, see for 
example, J.D. Jackson, Classical Electrodynamics, (Wiley, New York, 1975) 251-253. See also, 
J. Schwinger, Science 165 (1969) 757-761; and in quaternion formalism F. Giirsey, Rev. Fac. Sci. 
Istanbul A19 (1954) 154-160. 
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who not long after writing down Dirac's equation, objects that, in comparison with 
Maxwell's theory, Dirac's theory is not a true field theory, and goes on to devote 
more than two third of his time presenting a very ambitious and highly speculative 
nonlinear theory of matter? 

Clearly, for such self-confidence, the derivation from the same fundamental 
system, of both Dirac's and Proca's equations, must not have been enough. Sim- 
ilarly, the lack of interest shown by the leading quantum physicists of the time, 
several of them between 5 to 10 years younger than he, cannot explain why Lanc- 
zos took such a critical standpoint as to qualify quantum mechanics as a "more 
rigorous and therefore more intolerant form of positivism" p. 104]. Having 
worked with Einstein, now on his way to the United States, why did Lanczos 
feel the need to write such an extraordinary testimony as his thirty-five page re- 
view paper of 1931 [6|? Indeed, this paper discussed the fundamental theories of 
physics and their problems — and important related philosophical issues — and it 
also discussed Einstein's new field theory which included the concept of 'distant 
parallelism:' 

In our opinion, Lanczos was driven by his genius. Exactly what he will later 
say of Hamilton could be applied to himself: "The genius is not free to act because 
he is relentlessly driven by his 'daimonion,' as Socrates expressed it" (42, p. 140]. 

In Lanczos's case, his "demon" was his conviction that the fundamental equa- 
tions of matter had to contain some kind of a feedback mechanism in order to 
insure stability, that in their complete form these equations had to be nonlinear so 
that everything is fully constrained, and that the singular central region had to be 
explained by the same equations. 

Let us consider the question of feedback. As we already know, Lanczos was 
very impressed by his fundamental equation. It was Maxwell's with feedback and 
it had all the properties of a good field equation. In particular, he found that it 
had a conserved current and, moreover, a conserved energy-momentum tensor. In 
quaternions, this tensor has the following simple form 

8ttT() =b + ()b + m 2 A + ()A+ (...)~ (52) 

Here, as in the current (14*71 . making the expression ordinal symmetric has the 
effect of eliminating the 'magnetic' terms and making the corresponding tensor 
symmetric. Hence, Lanczos had done more than just put a mass term on the 
right hand side of Maxwell's equation: he had found essentially all the important 
properties that Proca will rediscover in 1936. 

To understand Lanczos's interest in Proca's equation and its energy-momentum 
tensor, we must remember that in his dissertation he proposed a field theory of 
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matter in which electrons were singularities in the Maxwell field 0. The first term 
in (l52l is nothing but the so-called Maxwell tensor, i.e., the energy-momentum 
tensor of the Maxwell field, 29 so that the second term is the correction arising from 
the feedback. Since this second term has the form of a pressure term, Lanczos's 
conclusion was that the vanishing of the divergence of d52l corresponds to the 
stabilization of the electron's field energy-momentum distribution. 

In effect, in a field theory of matter like Lanczos's electrodynamics, there is a 
kind of duality principle such that the physical properties of the electron can be 
attributed either to the singularity (the particle aspect) or to the field (the wave 
aspect). Thus, although it is not explicitly stated in any of Lanczos's papers, his 
idea was that his new equation had to describe the electromagnetic field of the 
electron, while Dirac's equation was only to describe the motion of the singularity. 

Such an interpretation is perfectly sensible. We all know of the infinities in 
classical and quantum electrodynamics that, in both cases, are somehow due to the 
unstable nature of the electron. In that respect, because of the coupling of Dirac's 
to Maxwell's equations, the success of quantum electrodynamics can be seen as 
the result of a cancellation between the non-zero divergence of Maxwell' tensor 
for the electromagnetic field and Tetrode's tensor for the Dirac field: 

SvrTQ = [(()■ V^D)ie 3 D+ - Die 3 [D+(v- ())] - e(0 • ())dD+ (53) 

In fact, in a truly consistent theory, both tensors should be divergenceless. Of 
course, starting with the introduction of stresses of an unknown nature by Poincare, 
there have been many attempts to modify Maxwell's tensor to produce stability. 
In particular, the recent attempt by Schwinger H51 is not without resembling 
Lanczos's proposal to replace Maxwell's tensor with Proca's. 

It is in his third paper that Lanczos expounds his idea: "Instead of the Dirac 
current we have the energy-momentum tensor, instead of current conservation, 
we have energy-momentum conservation" 01 p. 487]. However, when putting 
numbers in, it turns out that if m in (l46l) and d52l is the mass of the electron, 
everything comes out wrong: the mass of the photon is no more zero but equal to 
m ! Hence, Lanczos makes his next bold step: m is in fact not the electron mass, 
but a "field function" 01 p. 492] which is everywhere practically zero, except in 
the immediate neighborhood of the singularity. In this perspective, he suggests a 
new interpretation of wave mechanics: "the de Broglie-Schrodinger wave equation 
with constant m cannot characterize a single electron but only a large 'swarm' of 
electrons" 01 P- 493]. 

29 The quaternionic form of Maxwell's tensor was first given by Silberstein, Phil. Mag. 25 (1913) 
135-144. The remarkable thing is that if this tensor would have been adopted right away, many 
problems due to the improper use of the non-covariant Poynting vector could have been avoided. 
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More ideas and details will be given by Lanczos in his Berlin conference [4|. 
Summarizing his results and once again writing down Proca's equation, he insists 
that the mass term should in fact be a field function in order that the equation 
can be applied to the electron. To make this nonlinearity plausible, he rewrites 
Proca's equation in general tensor form and then, assuming a conformally invariant 
geometry, he explicitly shows that the mass term indeed becomes a function of 
space-time. 

More than sixty years after Lanczos suggested these ideas, we are still very 
far from a working nonlinear relativistic quantum theory. We may even say that 
Lanczos's ideas were just whishful thinking. But does it mean that Lanczos's 
general ideas about how to modify quantum theory were wrong? Not at all. And, 
in our opinion, the reason is that if we believe that at some level the laws of nature 
must be simple, there are only a very few ways in which we can modify the existing 
laws of relativistic quantum theory in order to reach a more complete theory. 

15 The principles of a nonlinear field theory of mat- 
ter 

When Lanczos left Europe for the United States, he was in his mid-thirties and 
possibly was one of the few physicists of the time with a really wide perspective 
over the whole of fundamental theoretical physics. He had worked in classical 
electrodynamics, quantum theory, atomic physics, and general relativity, and knew 
as much about the successes as the problems of these theories. In contrast to 
Dirac, who in 1930 wrote and published "The Principles of Quantum Mechanics" 

— which tend to imply that quantum theory is a theory essentially in its final form 

— Lanczos would summarize his doubts and certainties in part I of his long paper 
dealing with fundamental theories and related philosophical issues [6|. 

According to Lanczos, what are the truly fundamental principles of a physical 
theory of matter? 

Lanczos's starting point is a comparison of Einstein's general relativity to 
classical electrodynamics. In the former we have the prototype of a closed and 
predictive theory were everything is determined by the field equations: the geom- 
etry of space-time, the conservation laws, the motion of the masses. In classical 
electrodynamics, we have a satisfactory description of the electrical phenomena 
in the macroscopic domain, but a theory which fails at the microscopic level as 
soon as one meets with the problem of singularities. 
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As is well known from function theory, the behavior in the neighborhood of 
the singularities defines the properties of the whole function everywhere. It is 
therefore natural to consider the singularities as the 'source' of the function, and 
thus to see the matter/field duality as the physical counterpart of the mathematical 
duality of singular versus regular points of an analytic function. 

However, at the same time as the function becomes singular at a given point, 
it becomes undefined and as a result one has an uncertainty: the solution is by 
no means unique. In the most simple cases, such as in modern quantum electro- 
dynamics, one can circumvent this difficulty by the process of 'renormalization.' 
But universal properties such as the definite mass of electrons or protons, or the 
uniqueness of their electrical charge, cannot be defined by such solutions. 

Fortunately, under favorable conditions, differential equations and systems may 
have solutions which are everywhere regular, without being trivial. Such solutions 
are called 'eigensolutions' and play a fundamental role in wave mechanics. If 
the equations are linear, a constant factor is left undetermined, but if they are 
nonlinear everything is fully constrained. This is the case with general relativity, 
and Lanczos suggests therefore that all fundamental properties of matter should 
be determined by eigensolutions. The position of quantum mechanics relative to 
fundamental field theory is therefore similar to that of Maxwell's theory relative to 
the classical theory of the electron: "Wave mechanics does not solve the problem 
of matter, it just gives averages over space or time" p. 112]. 

In this perspective, Lanczos's hope is that a future fundamental theory will 
solve the problem of singularities at the same time as the problem of matter 
and electricity. To succeed in such a task, he puts forward a two-step program: 
"Generally covariant field equations giving to space-time a definite geometrical 
structure. Eigensolutions to these equations, giving on the one hand the electron, 
on the other hand the proton" [6 , p. 112]. 

Sixty years later, were do we stand? All attempts to incorporate electricity 
into a generalized form of general relativity have failed. And the number of 
'elementary particles' has vastly increased, from two in 1930, to several tens or 
hundreds, depending on whether the excited states are counted along with the 
fundamental ones. 

A key element seems to be missing. Here again we may refer to Lanczos's 193 1 
paper. Just after giving his 'program,' and before starting his analysis of Einstein's 
new theory of distant parallelism, he recalls the question of the universal constants: 
Should the fundamental equation already contain the universal constants of nature 
or not? Since the velocity of light allows the measurement of time in units 
of length, and since general relativity has reduced gravitational interaction to a 
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question of curvature, what is the proper unit of length for a theory of elementary 
particles? If we exclude the singular solutions in favor of the regular ones, the 
eigensolutions, what are then the meanings, if any, of the Planck length and of the 
classical electron radius? 

The missing element is ^fundamental length. If it is the Planck length, how can 
we explain the gigantic difference of this length relative to the electron radius? If 
it is the electron radius, which has about the correct size for a theory of elementary 
particles, why not take a length obtained by multiplying it by a dimension-less 
constant such as a = e 2 /he ~ 1/137 ? Moreover, what physical interpretation 
are we going to give to this length? 

Using all our contemporary knowledge, let us try to resume Lanczos's program. 
Let us do it within the framework of special relativity, for the present leaving aside 
the question of properly dealing with gravitation. What we are looking for is some 
fundamental equation which could give us an explanation for the nature and basic 
properties, such as the mass, of elementary particles. 

As a starting point, we observe that Lanczos's fundamental equation is ex- 
tremely successful because we can derive Dirac's as well as Proca's equation from 
it. Moreover, Lanczos's equation contains isospin, and thus an explanation of the 
existence of doublets of particles which are either electrically charged or neutral. 

As a second step, we consider Einstein-Mayer's generalization as a convincing 
hint that assuming the mass term in Lanczos's equation to be a biquaternionic 
parameter, we can lift the mass degeneracy of Lanczos's equation, and thus explain 
the existence of electron and neutrinos, and at the same time explain the existence 
of strongly interacting nucleon pairs. In this last case, the E field introduced 
by Einstein and Mayer is not just a parameter but the pseudoscalar field which 
corresponds to the pions. 

The third step is to systematically study the possible generalization of the 
Lanczos-Einstein-Mayer system in order to discover other meaningful physical 
interpretations. In doing this generalization, it is worthwhile using, as Lanczos 
did, two heuristic guides. First, Lanczos's feedback idea, i.e., the concept that 
fundamental equations should be closed systems. Second, also following Lanczos, 
Hamilton's conjecture, i.e., the idea that somehow biquaternions are an ideal 
formalism for writing the fundamental equations of physics ifTTll . 
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16 Petiau's nonlinear wave mechanics and Barut's 
mass formula 



In 1965, about the time when Gell-Mann and Zweig proposed the concept of 
quarks, in complete independence from mainstream research and without knowing 
that he was in the path of Lanczos, the French physicist Gerard Petiau 30 started 
studying systems of equations which may precisely give a solution to the problem 
of the mass of leptons and quarks 1441 l45ll . In more than fifteen papers, mostly 
published between 1957 and 1969, Petiau analyzed many different aspects of 
nonlinear wave mechanics, looked at regular as well as singular solutions, and 
considered the problem of how to extend the canonical formalism, the question of 
second quantization, etc. All of that he will do mostly following the instinct of a 
physicist, with everything very neatly written down and explained starting from 
first principles, in the pure French tradition of mathematics. 

Although Petiau was thinking in very general terms, considering complicated 
couplings between particles of various intrinsic spin, it is easy with quaternions 
to write his fundamental equation in the case of spin | particles. It essentially 
amounts, in the spirit of Lanczos's feedback idea, to adding a third equation to 
Einstein's system (B4"l) in order to close it: 

VA = BE 

VB = AE (54) 
VE = AB 

Here A and B are the fundamental Lanczos fields, and E = E + an additional 
Einstein-Mayer field of spin 0. In the most simple case, E is a real scalar function 
of space-time and can be seen as replacing the mass in Lanczos's fundamental 
equation (fT9"l). 31 As usual, to get the physical spin ~ eigenstates, we have just to 
make the Lanczos-Giirsey isospin superposition 

Because d54l is a closed nonlinear system, the solutions are much more con- 
strained than with the usual linear type of wave equations. For instance, the 

30 Gerard Petiau (1911-1990) was one of the first students of Louis de Broglie. He published 
many papers on waves equations of all possible spin. He is best known for his contribution to the 
Kemmer-Duffin-Petiau formalism of spin 1 particles. He worked out many of the mathematical 
details of de Broglie's 'double solution' theory of causal nonlinear quantum mechanics. His major 
discovery is the elliptic function generalization of wave functions, which continuously interpolate 
between de Broglie's and solitonic waves. 

31 The function E 2 corresponds to the 'metric factor' a introduced by Lanczos in his general- 
ization of the Proca equation for conformally flat space-times p. 127]. 
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single-periodic de Broglie waves that quantum mechanics associates with a par- 
ticle become double-periodic Petiau waves Hoi . Instead of being linear combi- 
nations of sin(z) and cos(z) functions, these waves are superpositions of elliptic 
functions sn(z, k), ca(z, k), etc. A very appealing feature of Petiau waves is that 
their dependence on the modulus k interpolates between pure de Broglie waves 
(for k = 0) and pure solitonic waves (for k = 1): a beautiful realization of the 
wave/particle duality of quantum mechanics. Moreover, both the amplitudes and 
the proper frequency (the /i factor appearing in the argument of sn[fj,(Et — p-x) , k) , 
for example) will be quantized. This is exactly the kind of behavior Lanczos was 
expecting. 

The most interesting thing, however, happens when, in order to quantize the 
system, the Hamiltonian function is constructed. Taking, for example, A as the 
fundamental field, Petiau P4l showed that, in terms of the first integrals, the 
Hamiltonian will have a very simple form 

H = H k 2 ^ (55) 

where k is the modulus of the elliptic function, [i the proper frequency, and H 
some constant. The exciting thing is that the Hamiltonian, and thus the total energy 
in the field (i.e., for a single particle, the effective mass) scales with the fourth 
power of [i. 

In effect, in 1979, Barut lH71 discovered a very good empirical formula for 
the mass of the leptons. Assuming that a quantized self-energy of magnitude 
|a _1 m e A^ 4 , where N = 0, 1, 2..., is a new quantum number, be added to the 
rest-mass of a lepton to get the next heavy lepton in the chain electron, muon, 
tau,... , Barut got the following expression (where a ~ 1/137): 

o n=N 

m{N) = m e (l + -a' 1 ™ 4 ) ( 56 ) 

n=0 

The agreement with the data of this rather simple formula is surprisingly good, the 
discrepancy being of order 10~ 4 for the muon and 1CT 3 for the tau, respectively: 
see Ref. [48 1 or Table 1 . In order to get the masses of the quarks ll49l . it is enough 
to take for the mass of the lightest quark m u = m e /7.47 . Again, we see in Table 1 
that the agreement between the theoretical quark masses and the 'observed' masses 
is quite good, especially for the three heavy quarks [49]. 

Petiau's Hamiltonian d55t shows that the energy density of the solutions to his 
system (l54l) is a constant which depends only on the two parameters k and fi. If 
we assume, just like Lanczos did in 1929 [4], that the singular part of the field is 
contained in a small region of finite extension, the 'mass' of the singularity will 
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Table 1: Comparison of lepton and quark masses in MeV/c 2 calculated with 
Barut's formula (l56l) to measured masses from Ref. [48 1. The observation of a 
sixth quark of mass in the range of 160'000 to 190'000 MeV/c 2 has been reported 
at the beginning of 1995 iBTll . 



have the same form as (l55l) . Moreover, without going into the details of a specific 
model ll49l . we can expect — from the general principles of quantum mechanics 
— that if there is a spectrum of solutions, they will correspond to integer multiples 
of the proper frequency /i. Hence, for fixed k, the mass spectrum of elementary 
particles will correspond to a sum of partial contributions of the form 

m(n) = mo n A (57) 

Comparing with (l56l) . this expression establishes the link between Petiau's non- 
linear fields and Barut's formula for the mass of quark and electrons. If this is 
correct, what about the factor 7.47 ? 

There are two non-trivial exceptional cases for elliptic functions: the harmonic 
case, k = sin(7r/4), and the equianharmonic case, k = sin(7r/12). It is very 
plausible to associate the former with leptons, and the latter with quarks. Indeed, 
in either case, the corresponding elliptic functions exhibit several unique symmetry 
and scaling properties, which come from the fact that in the complex plane their 
poles form a modular aggregate with n/2 or 7r/3 symmetries. Since according to 
(l55l) the mass is proportional to k 2 , the lepton to quark mass ratio is then equal to 
about 7.47. 

A final very interesting feature of Petiau's nonlinear system (|54t is that it has 
two general classes of solutions to which correspond two plane waves limits P4l . 
In the first class, the third equation decouples and we get the standard Lanczos 
equation in which A and B are the two spinors of opposite chiralities which are 
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interchanged in a spatial reversal operation. The degenerate Petiau system is thus 
equivalent to the Lanczos or Lanczos-Einstein-Mayer equations from which we 
can derive two Dirac's equations, one for each isospin states. 

In the plane wave limit of the second class, one of the spinor fields A or B 
decouples: We are left with a lone spinor coupled to the scalar field E. It is then not 
possible to construct a parity covariant field with just one degenerate Petiau system. 
Hence, while the first class clearly corresponds to leptons (or more generally to 
particles such as the nucleons which have well defined spin ~ asymptotic states), 
we can interpret the second class as quark states. 'Confinement' is then explained 
by the above impossibility of the second class of solutions to have Dirac-like 
asymptotic plane wave limits. 

In order get hadrons, i.e., particles of definite parity made out of quarks, it is 
then necessary to make covariant superpositions of two or three quark states. In 
this picture, the quark states are defined as Lanczos fields and thus do not have 
a definite spin or parity. Hence, contrary to the Standard Model in which the 
quarks are Dirac fields, there is no contradiction with Pauli's exclusion principle. 
However, as a conclusion to these ideas, we stress that the details of such a theory 
of hadrons have not yet been worked out. 

17 The fundamental length and the elementary par- 
ticle mass spectrum 

It is not difficult to show that the N A power law dependence for the total energy 
which we have found for the Lanczos-Petiau's system <l54T> is a general property 
of all Lagrangian field theories in which there is a nonlinear term in the fourth 
power of the field amplitude. From a theoretical point of view, this is a very nice 
Lagrangian because it has many good properties. In particular, it is conformally 
invariant and renormalizable. From a phenomenological point of view, the N 4 
power law dependence is also very good. In effect, such a strong power law 
dependence is essential in order to explain the very big difference in mass between 
the lightest fundamental particle, the electron (m e c 2 = 0.5 MeV), and the fifth 
and sixth quarks (mc 2 ~ 5000 and ~ 175'000 MeV). Hence, we see that Petiau's 
equation is more than just a generalization of Lanczos-Einstein's system and that 
there is a broad theoretical and empirical basis for it, although it may be just a first 
approximation to the complete theory. 

If we now take a purely fundamental point of view, it is important to remark 
that many different theories could in principle lead to a Lagrangian with a quartic 
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term, and that in this respect Petiau's ideas are not necessarily the true ones. In 
particular, many nonlinear generalization of well established theories, such as 
gravitation ifSOll or electrodynamics iBTl . quite naturally lead to such Lagrangians. 
The problem with these generalizations is that the scale is then either the Planck 
length (in which case the nonlinear effects are very small), some fraction of the 
electron radius r e , or, as in the present model, some unknown parameter. 

Comparing expression (l57l) of the particle mass to the empirical formula d56l) . 
we find that (for the lepton spectrum) the constant m corresponds to the energy 
equivalent of the muon mass, i.e., about 105 MeV = |a _1 m e c 2 . This figure is 
very interesting because the typical excitation energies of hadrons are precisely 
of this order. Moreover, the mass of most elementary particles have values which 
are very close to integer or half-integer multiples of a~ l m e c 2 = 70 MeV ll52l . 
If we assume that the universal coupling constant of all interactions (except the 
gravitational one) of the elementary particles is defined by the electric charge 
quantum e, we derive from the formula r = e 2 /m c 2 the fundamental length 

r = ar e (58) 

where we have discarded the factor | because in the present discussion we are 
only interested in order of magnitude considerations. 

A fundamental length (or fundamental energy), expected to become the fourth 
fundamental constant, has been advocated by a number of authors and in particular 
by Heisenberg [f53l and Stiickelberg [54|. The problem, as we have already said, is 
to find its value and its correct interpretation. For Heisenberg and Stiickelberg, the 
fundamental length was estimated to be of the order of r e . From the mass spectrum 
of the electron and quarks, and the assumption that the relevant interaction constant 
was the electric charge, we have derived d58l . Let us give some further evidence 
that this is indeed the fundamental that explains the mass spectrum of elementary 
particles. 

The first quality of a fundamental length is that it must characterize the thresh- 
old for some new physics. In this respect, one must remember that because e 2 and 
he have the same dimensions (and thus a ratio a = e 2 / he which is pure number) 
one can always associate two different lengths to a given energy or mass. Hence, 
since the larger of these is associated with the wave-aspect and the shorter one 
with the particle-aspect of a massive object, each length has to be considered in the 
proper context. The electron mass, for example, gives the electron radius which is 
the distance from a point charge below which classical electrodynamics becomes 
inconsistent; on the other hand, the Compton wave-length of the electron is the 
threshold for electron-positron pair-creation, a typical phenomenon of relativistic 
quantum electrodynamics (QED). 
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Since QED is by far the most precise and best understood part of elementary 
particle theory, the threshold for 'new physics' in the context of QED is — as is well 
known — the phenomenon of muon pair-production. 32 This precisely corresponds 
to the energy that led to the fundamental length d58t . Hence, QED sets an upper 
limit to the fundamental length r , which therefore has to be at most of the order 
of r = ar e so that pairs of particles other than electrons may be created. 

In Sec. [161 in order to derive formula d57t . we have assumed that a space-time 
region of finite extension was associated with the particle. The size of this region, 
a kind of a bag which contains the energy corresponding to its mass, 33 is of course 
our fundamental length r . This concept can be used independently of nonlinear 
wave mechanics and Petiau waves, and leads to nonlinear models that are typical 
of those we referred to in the beginning of the present section (for an example, see 
Ref. J53J). But, whatever the details of the model, the size of this bag has to be 
consistent with quantum theory and with special relativity. This will give us two 
lower limits to the fundamental length. 

In quantum theory, a most fundamental principle is that a wave function has to 
be associated with every particle. In the rest frame of the particle, it has the form 
exp(imc 2 t / h) . If a finite region of space-time is also associated with the particle, 
compatibility requires that its size has to be smaller than the wave-length of the 
wave function. This means that r < h/mc, which corresponds to an approximate 
upper limit to the mass spectrum 

he 

mc 2 < — = a~ 2 m e c 2 ~ 9.6 GeV (59) 

A second constraint on the mass of a particle of finite extension is given by the 
minimum radius of a relativistic particle with spin. In effect, if we attribute an 
intrinsic angular momentum to a relativistic particle, its center of mass cannot be 
defined more precisely than the ratio of the spin angular-momentum to the particle 
mass [55|. Thus, to be compatible with this relativistic effect, i.e., in order that 
our lepton or quark can still be considered as a spin ~ point particle, we need 
r < h2mc, which gives an upper limit of about 5 GeV, i.e., of the same order a/s 
the above quantum criterion. 

32 By 'new physics' we do not mean a breakdown of QED, but phenomena such as the creation 
of pair of particles heavier then the electron, the properties of which are not explained by QED. 
For electromagnetic interactions, QED has been tested to extremely short distances, corresponding 
to energies of the order of 10 6 MeV. 

33 It is important to stress that this quantum mechanical 'bag' does not have to correspond to 
the extension of some directly measurable quantity, such as the electric charge or hadronic matter 
distribution in the bag model of hadrons. It is more like a volume of phase-space such as is 
introduced in field quantization. 
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Looking at Table 1, we see that there is no lepton or quark with a masselarger 
than a~ 2 m e = 9.6 GeV/c 2 . We therefore conclude that r = ar e is really the 
fundamental length we were looking for and that, as expected by Lanczos, it is 
neither the electron radius or the Planck length. 34 

Finally, we notice that r is essentially the electron radius times a, the mys- 
terious fine structure constant of Schrodinger. This is one more hint that r is 
indeed a fundamental length. In effect, while the electron radius can be seen as the 
simplest combination of elementary quantities giving a scale with the dimension 
of a length, the determination of alpha is now equivalent to deriving r from some 
more fundamental theory, unless it is assumed that r is truly fundamental, in 
which case alpha is nothing but the ratio of tq to r e . 35 

In the currently Standard Model of elementary particles, a sixth quark with 
a very large mass is needed for theoretical and phenomenological reasons. The 
discovery of this sixth quark could therefore be interpreted as an argument against 
the existence of a fundamental length such as required by the present interpretation 
of the Barut formula. The Standard Model, however, is basically a perturbative 
theory of the interactions of quark and leptons, a theory that is logically independent 
of our interpretation of mass quantization. In this perspective, the conjunction of 
our theory of mass with one of interactions could explain the need for a very 
massive sixth quark, as well as the existence of very light neutrinos, which are 
also predicted by our model [49]. 



18 Conclusion: The advent of nonlinear quantum 
theory 

Lanczos's idea of 1929 to derive Dirac's equation from a more fundamental one 
was a highly heretical concept. It implied a kind of Copernician revolution which 
was clearly not acceptable to the 'fathers of quantum mechanics.' Moreover, the 
essentially theoretical reasoning of Lanczos was too far from the experimental 

34 We remark that these considerations, and thus the mass limit ( I59t . applies only to quarks and 
leptons, i.e., to particles of 'matter,' and not necessarily to bosons such the W and Zq, which are 
quanta of effective interaction fields, and which have masses of 80 and 91 GeV, respectively. 

35 It should be mentioned that the fundamental significance of ar e is also suggested by classical 
models of particles with spin. For example, in the model of E. Groschwitz, the charge of the 
electron is supposed to be distributed over a sphere of radius ar e . The motion of the electron is 
then a complicated 'Zitterbewegung'-like trajectory which has two interesting limiting cases: one 
with an effective mass of the order of the electron mass, the other with an effective mass of the 
order of the muon mass. See pages 357-361 in H. Holn, Ergebinisse d. exakten Naturwiss., Bd. 
XXVI (1952) 291-382. 
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situation of the time to be confronted with the data. The fate of Lanczos's ideas 
is therefore somewhat similar to those of de Broglie when he proposed in 1927 
his double solution theory: nobody took serious interest in it before the concept 
was revived in the 1950's by the work of David Bohm on deterministic quantum 
theory. 

The philosophical lesson is that pure theoretical insight has much more difficul- 
ties gaining acceptance than does any kind of a more phenomenological approach. 
Because of the brilliant success, following World War II, of the young 'quantum 
engineers' with renormalizable quantum electrodynamics, the pragmatic approach 
is possibly even more in favor now than in Lanczos's days. The best example of 
this is the Standard Model, a highly successful effective phenomenology, which 
gives numerical agreement for a vast number of phenomena, but which neverthe- 
less does not answer most of the truly fundamental problems which are the real 
concern of theoretical physics. 

In the highly competitive context of contemporary research, it is sad that 
Lanczos never learned that he had discovered isospin in 1929. This is particularly 
regrettable since the final interpretation of his fundamental equation was given 
in 1957 by Giirsey, who therefore deserves a lot of credit for it, although in the 
papers which made him famous, Giirsey did not quote Lanczos. 36 Five years later, 
in 1962, Lanczos will again rediscover Dirac's equation, this time in the context 
of general relativity HB], although without the mass term. Again he will refer to 
quaternions and to his contention about their special role in physics: 

"The spontaneous and quite unexpected appearance of quaternion 
calculus and the Dirac equation (the fundamental building block of 
electromagnetism and of quantum theory) in the field equations of 
general relativity can hardly be considered as mere accidents" (56l 
p. 389]. 

Until the end, Lanczos continued the work on fundamental theory he had 
started in 1919 with his dissertation and that reached a climax with his review 
paper of 1 93 1 . He could never forget the theoretical insight he had gained with his 
quaternionic derivation of Dirac's equation, and the fact that this major contribution 
was ignored. 

36 It is also unfortunate that a number of researchers with an interest in quaternion techniques may 
have been turned away from Lanczos's work on Dirac's equation by a partially incorrect statement 
of A.W. Conway, who wrote: "Quaternions have been used by Lanczos (1929) to discuss a different 
form of wave equation, but here the Dirac form is discussed, the wave function being taken as a 
quaternion and the four-row matrices being linear functions of a quaternion." See, A.W. Conway, 
Quaternion treatment of the electron wave equation, Proc. Roy. Soc. A 162 (1937) 145-154. 
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But the contemptuous attitude of Lanczos's contemporaries is not a proof of 
the superiority of the dominant ideas of the time. Neither is the imperfection 
of Lanczos's formulation of 1929 a sufficient reason for rejecting his ideas and 
excluding him from the small circle of the leading physicists who were to put 
the dogma of the new physics into final shape. It is therefore easy to understand 
the feelings of Lanczos when he criticized the intolerant positivist attitude of the 
quantum physicists, or announced the end of quantum theory with the advent of a 
more fundamental nonlinear one 0. 
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19 Appendix: Comparison of Dirac-Lanczos 's and 
Dirac's equations 



Let us compare the Dirac-Lanczos equation (TBI) : 

VD = mD*iu. (14') 

to the Dirac equation: 

= mV, (60) 

Two representations of the Dirac equation such that the two quadruplets of 
gamma-matrices are related by 7^ = Sj^S -1 , where S = S + E SU(A) is unitary, 
are equivalent as the far as the physical consequences of the theory are concerned. 37 

In the Dirac-Lanczos equation, the arbitrariness in the choice of the 'Dirac 
matrices' 7^ corresponds to the freedom of taking any arbitrary unit vector z7 as a 
factor on the right hand side. Therefore, two representations of Lanczos's equation 
such that V = QuQ' 1 , where Q = Q + E SU (2) is unitary, are equivalent. 

Consequently, in Dirac's formalism, the arbitrariness in the representation 
corresponds to a 1 5-parameter group, and in Lanczos's formalism to a 3-parameter 
group. 

In Dirac's formulation the 4-complex-component electron field is taken as a 
4x1 column vector ^, and the linear operators are 4 x 4 complex matrices. 

In Lanczos's formulation the same 4-complex-component field is a biquater- 
nion D E M = M 2 (C) = C£ 1>2 = C£ 3>0 . The linear operators are then linear 
biquaternions functions of biquaternions, which are isomorphic to the algebra of 
4x4 complex matrices M 4 (C) = M 2 (B) = C£ 4 ,i. 

In both formulations the operator space has 4 x 4 x 2 = 32 dimensions over 
the reals. The difference is that in the Dirac formulation the field is an abstract 
4-component column vector, while in the Lanczos formulation the field is directly 
related to the algebraic structure of spacetime because any biquaternion D = s + v 
is the direct sum of a scalar s and a 3-component vector v. 

Lanczos's formulation is therefore more suitable than Dirac's for studying 
and demonstrating the "classical" aspects of the electron field, and for making 
comparisons with the Maxwell and Proca fields which are usually expressed in 
terms of scalars and vectors l!53*l . 

37 See, e.g., J.J. Sakurai, Advanced Quantum Mechanics (Addison Wesley, 1987) section 3-2. 
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Finally, in terms of Clifford algebras, the Dirac field ^ is a degenerate 8-real- 
component element of the 32-dimensional Clifford algebra C£ 4j i (i.e., an element 
of an ideal of that algebra) while the Lanczos field D is any 8-real-component 
element of the 8-dimensional Clifford algebra Cl\p = B, which is therefore the 
smallest algebra in which Dirac's electron theory can be fully expressed. 

For more details see Refs. ifTTl l33l l62l l63l l64ll . where it is shown that the 
complex conjugation operation appearing on the right-hand side of Eq. (1141 is 
characteristic of the fermionic character of the Dirac field, in contradistinction to 
the Maxwell and Proca fields which are bosonic fields ll63i 
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